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I. INTRODUCTION 

bo"; 

>^ , Cosmological perturbation theory has recently enjoyed renewed interest. Linear or first order theory is still a very 

^r^ ' active field of research, though the focus has moved on to higher order and even fully non-linear theory. This is to a 

I large extent due to the availability of much improved data sets: whereas previously linear theory was sufficient and 

C^H- the power spectrum the observable of choice, now the quality and quantity of the data is such, that higher order 

.]j^ \ observables, as for example the bispectrum, can be compared with the theoretical predictions. These new data sets 

come from observations of the Cosmic Microwave Background (CMB) on the one hand, such as the one already in 

CN ' progress by WMAP and in the near future also by PLANCK. But also from 21cm surveys on the other hand, mapping 

^^ , the anisotropics in neutral hydrogen, such as LOFAR, now under construction and SKA, currently in its design phase. 

i__i' Einstein's theory of General Relativity (GR) is highly non-linear, it is therefore difficult to deal with in all but 

r^ , the simplest situations using the full theory. Fortunately for cosmologists the universe appears to the homogeneous 

Qh' and isotropic to a remarkable degree so the Friedmann-Robertson- Walker (FRW) metric is adequate for many 

X. ] purposes. For instance, once known local features are removed, the CMB is isotropic to an accuracy of ST/T of 10~^. 

5^ ■ However if we want greater resolution or more detail then the approximation has to take into account anisotropy 

f/2 ' and inhomogeneity. At present this cannot be done in full generality since we do not have the appropriate exact 

C^ . solutions to Einstein's equations. This is not surprising, given their highly non-linear nature. To deal with this 

problem cosmologists have resorted to perturbation methods, which have proved effective in other areas of physics. 

pq ', Previous relevant works on perturbation theory in cosmology at a linear order include Refs. [ll, H, S, 0, a lO, 0] , and at 

J> ■ second order @, M, SO, \lM, \l2, \lM ■ Beyond linear order the literature tends to be very technical and difficult for the 

^^^ [ non-specialist to follow. In this paper we aim to strike a balance between mathematical rigour using the language and 

C ■ ■ tools of differential geometry, and usability and applicability to the problems of theoretical astrophysics and cosmology. 

CN ■ 

^ ■ , The essential idea behind perturbation theory is very simple, and best illustrated by an example for which we 

^^ ' choose the metric tensor in standard cosmology. We assume that we can approximate the full metric (5^1/) of the 

^^ universe by an expansion 
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The metric gl^J , called the background, is the FRW metric with appropriate spatial curvature, i.e. K — 0,1,-1 
according to the assumptions made about the universe. The remaining terms are the perturbations of the background. 
C^ ' The first order part is given by 

9^.. - 9l?J = 9i'J , (1-2) 

where the remaining terms are assumed to be negligible compared to g},^^ and are neglected at first order. In a similar 
way the higher order perturbations can be identified. This can be described simply if we assume that the series can 
be written as 

9^..=9l?J+e9l!J+e'~gl?J + ... (1.3) 

where the quantities with tildes have absolute magnitudes of order unity, and we assume that e <^ 1. To zeroth order 



we have gf^i, ~ gjiJ and at first order 



9f.u = 9^J + e~gl!J , (1.4) 



and so on using the fact that at each order the higher order terms can be ignored. In practise it is often a nuisance 
to introduce the parameter e so, where appropriate we wiU use the form (jl.ip . Issues of convergence can be removed 
by working within a smaU enough neighbourhood of the background. 

Having set up the approximation p.ip . we have to substitute it into the Einstein equations 

Gp^ + Kg^y = KTf_,^ , (1.5) 

to obtain approximate solutions at the required order of approximation for the apphcation we have in mind. This 
is more difficuh than one might imagine. Firstly, perturbations of the metric imply perturbations of the energy 
momentum tensor, but more importantly, calculation of the connection coefficients and the Ricci tensor involves 
raising and lowering indices and involves terms of different orders. At zero and first order this is not a problem, but 
at higher orders it makes the calculations much more complicated and so the choice of coordinates or form of the 
metric can be important. Already at second order we have "proper" second order terms and terms quadratic in the 
first order quantities. 

Another problem arising in cosmological perturbation theory is the presence of spurious coordinate artefacts or 
gauge modes in the calculations. Although GR is covariant, i.e. manifestly coordinate choice independent, splitting 
variables into a background part and a perturbation is not a covariant procedure, and therefore introduces this 
gauge dependence. Prior to 1980 the gauge modes were handled on a case by case basis, when Bardeen in Ref. [g| 
resolved the issue and provided a systematic procedure for eliminating the gauge freedom at first order. Although it 
is sometimes argued that the covariant approach [141 avoids the issue of gauge choice it corresponds to the comoving 
gauge which is made explicit by the inclusion of the velocity field [l5| . Below we will address the gauge issue in some 
detail and explain how it can be resolved at first and at second order. 

The paper is organised as follows. In the next section we introduce perturbation theory using notation and concepts 
from differential geometry. In particular we discuss the definition of perturbations and how perturbations change under 
small coordinate changes. In Section [1111 we apply the concepts and results of Section [TTl We discuss the construction 
of gauge-invariant variables at first and second order. Amongst the examples discussed is the second order tensor 
perturbation and how it can be rendered gauge- invariant. We discuss our results in Section [Tvl We finish this paper 
with an appendix in which we describe the relevant concepts from differential geometry used in Section [TT[ 

We predominantly use conformal time, rj, related to coordinate time t hy dt = adrj, where a is the scale factor. 
Derivatives with respect to conformal time are denoted by a dash. Greek indices, /i, v, A, run from 0, ... 3, upper case 
Latin indices, A, B, C run from 0, ... ,4, while lower case Latin indices, i, j. A:, run from 1, ... 3. 

II. PERTURBATION THEORY 

In this section we introduce perturbation theory using differential geometry. Though focusing on cosmology, we 
keep the discussion general. After giving a definition of perturbations we introduce and define the concept of gauge 
and study how perturbations change under gauge transformations. The relevant definitions from differential geometry 
are discussed in Appendix A. 

A. Cosmological perturbation theory: perturbations of space-time 

The application of perturbation methods in space-time brings in a new problem, since among the physical 
quantities to be perturbed is the space-time itself. Also, because the results will be used in Relativistic Cosmology, 
the theory and results must be covariant. These requirement lead us to base our discussion on the explicitly 
coordinate independent description of Sachs [l| , Stewart and Walker [2] and Stewart [j] . In this case by coordinate 
independent we mean that the description docs not require coordinates and so is intrinsically covariant. We can 
and do introduce coordinates to do calculations and simplify the exposition. An alternative and widely used, but 
coordinate dependent, description is given in Ref. [j]. As one would expect the results are the same although, in 
our view, it is easier to understand and see the source of the final equations in the description of Ref. [3| . We will 
however describe both procedures and show the connection between the two approaches. 

We now follow Stewart [3| closely and consider a one parameter family of 4-manifolds Me embedded in a 5-manifold 
M. Each manifold in the family represents a perturbed space-time with the base or unperturbed space-time manifold 
represented by Mq. We define a point identification map P^ : Ado — > Me which identifies points in the unperturbed 
manifolds with points in the perturbed manifold. This correspondence specifies a vector field X upon Af. This field 




FIG. 1: The vector field X^ generates a point map between the manifolds Mo and A^j 
(pe between coordinate neighbourhoods on the manifolds. 



This in turn yields a diffeomorphism 



is transverse to TWe at all points. The points which lie on the same integral curve 7 of X are to be regarded as the 
same point, see Fig. [TJ This can be expressed in terms of coordinates. Choose coordinates x'^ on Mq and extend 
them to M by requiring that x^ = constant along each of the curves 7. This induces coordinates {x"^ = {x^,e)} with 
A = 0,1, ..,4: and fi, v, .. = 0, 1, .., 3 on A/". We parametrise the curves 7 by e and so dx^ /de = X^ and we choose the 
scaling of e such that 



Mn^M, 



(2.1) 



In this way the vector field X generates a one to one, invertible, differentiablc mapping between A^o and A^e, i.e. a 
one-parameter group of diffeomorphisms and it follows that (j)^(l)g — 0c+e- In particular the inverse map from A^^ to 
A4o will be denoted, in an obvious notation, by 4>~^ = 0_e and the identity map is given by 0e=o- 

Given a geometric quantity T defined on J\f the simplest way to produce a perturbation expansion of T is to expand 
it as a Taylor series along 7. This yields a covariant power series for T along the curve. To first order the series has 
the form 0] 



,T, = To + e {£xT) 



O U' 



(2.2) 



where the 0, is used to indicate that the quantity is the puUback, i.e. it is T^ evaluated at the point where e = 0. Lie 
derivatives are used instead of partial derivatives so that the series is covariant. For reasons that will become obvious 
later it is convenient that the series is pulled back to A4o (see Eq. (jA20[) ). At higher orders the Taylor expansion is 
given by 3 
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(2.3) 



where we note again that ^^Te is evaluated on M.q. The expansion automatically provides the covariant perturbation 
expansion we want. Each term in the series is proportional to a power of e. The first term Tq is proportional to e", 

the background value, the next term e (£xT) 

term is given by ^ {£^T) 
The expansion Eq, 



is proportional to e to the first order and so on, and the n order 



can be written in a compact and useful form using the exponential operator, 



(2.4) 



Here (/)*Te is the perturbed value of T pulled back to A/(o and so the perturbed value of T is given by 

(p^ST^ = 0*Tc - To , 



where we note that we could not have done the subtraction if we had not puUed T^ back to A^q. In an ahcrnative 
notation, commonly used in the literature, we include the e with the T and write 



where 



T = Tn + 5T, 



5T = Ti + ^T2 + ^n 



(2.5) 



(2.6) 



with T„ - e" i£3cT)o- 

In Ref. [41] the approach is as follows. On a single space-time manifold M with coordinates a;^ define a background 
model by assigning to all geometric fields Q a fixed background value '■"'Q, which is not itself a geometric quantity, 
at each point on the manifold. While the fields Q may transform as scalar, vector or tensor fields we require that the 
(°)(3 be fixed functions of the coordinates. Under a coordinates transformation the ^^^Q will have the same functional 
dependence on the new coordinates as they had on the old ones. A perturbation is then given by 



SQ = Q 



(0) 



Q. 



(2.7) 



To relate the two approaches we can think of the ^^'Q quantity playing the role of a quantity defined on AAq 
in the Stewart description and the coordinate change corresponding to a change of coordinates on A^^. But it is 
important to note that the approach of Ref. [4| only one manifold is necessary. The Stewart approach [3| avoids the 
need for the quantity ^^'Q, which is not covariant and gives a simple diagrammatic representation at the price of 
having to introduce the abstract 5-dimensional manifold A/". However, note that it is the split into a background and 
a perturbation which in general is not covariant. This split is common to both approaches and it gives rise to the 
gauge dependence. 

B. Gauge Transformations 

Gauge is arguably the most over-used word in mathematics and physics. Sometimes the meanings are related but 
often they are not and it is a waste of time trying to relate them. To avoid confusion we recommend that the word 
"gauge" as used here is interpreted as defined and not related to other uses of the word. The choice of correspondence 
between points on Mq with those on Ai^ or, equivalently, the choice of a vector field X is a gauge choice. The vector 
field X is called the generator of the gauge. 

Let us now turn to defining gauge dependence in a clearer way. Consider a point p in Mq a-nd the generators X 
and Y corresponding to two different gauge choices (see Fig. ^. The choice X will identify point p on Mq with a 
point q on A4^ and will assign to q the same x^ coordinates as at point p. On the other hand the gauge choice Y will 
identify p with a different point u on Ale assigning in its turn the coordinates of p to u. Clearly the choice of gauge 
induces a coordinate change (a gauge transformation) on A/(c. This interpretation is called the passive view Ref. [j,l8|. 





FIG. 2: On the left panel, the passive view: The point p on the manifold A^o is mapped to two different points q and u on AIe 
depending on the choice of gauge, corresponding to the choice of vector field, we make. On the right panel, the active view: 
the points p and q on Mo both map to the point u on Ale. Again the choice of gauge determines the mapping. The vector 
fields generate the gauge choice. A change in gauge from X^ to Y^ produces a gauge transformation. 



FIG. 3: For the active point of view of a gauge transformation we choose a point on A4e and determine the points p and q on 
A^o which map to u under the gauge choices V'e and cjie- The map $£ which maps the point p to the point q is then formed by 
first mapping p to it using the map tp^ and then mapping u to g using the map (^_e. Thus $e = (/>_£ o i/j^. 



For the active view we choose a point u on Aie and find the point p on A^o which maps to u under the gauge choice 
X and the point q, also on Mo, which maps to u under the gauge choice Y, sec Fig. |3l The gauge transformation 
this time is defined on Mo and takes the coordinates of q to those oi p in one of the two choices of gauge. 

In summary as we shall explain in more detail below, in the active approach the transformation of the perturbed 
quantities is evaluated at the same coordinate point, whereas in the passive approach the transformation is taken at 
the same physical point. 

In the passive approach of Ref. [J] the role of the background manifold is played by the background quantities 
^°^Q and the coordinate transformation corresponding to the gauge choice only affects the geometric quantities Q. 
The perturbation is the difference between ^^'Q and Q so only half the quantities determining the perturbation are 
transformed by the gauge transformation. 

The gauge dependence in perturbation theory stems from the fact that we separate quantities into a background and 
a perturbed part, a operation not covariant in general, which introduces additional, unphysical degrees of freedom. 
However, as shown below in Section IIIIl by choosing and combining suitable matter and metric variables the gauge 
dependencies can be made to cancel out (the quantities so constructed will not change under a gauge transformation). 
This process is equivalent to choosing suitable physical hypcrsurfaccs, say comoving or of uniform curvature ^ . 

1. Active point of view 

To take the argument further we will now focus on the active interpretation of the gauge transformation. Corre- 
sponding to the gauge choice X, i.e. the choice of the vector field X transverse to Mo we have a diffeomorphism (f)^ 
where (j)^ : Mo — > Me and corresponding to the vector field Y we have a diffeomorphism ipe ■ Mo -^ Me- For all e 
these two vector fields induce a diffeomorphism (gauge transformation) $e on Mo given by, see Fig. [3j 

<i>e:Mo^Mo, (2.8) 

where $£ is made up of two parts - a map ipe from Mo to Me and a map (/>_£ from Me to A^Oj i-S- 

<i>e:=<P-e0^e. (2.9) 



^ One should not confuse gauge independence in perturbation theory with what is called gauge choice in general relativity which arises 
from coordinate invarianco: the Bianchi identities introduce four additional degrees of freedom into the Einstein equations. This allows 
to always choose four free functions in the metric, i.e. four particular coordinate functions, that might simplify the problem under 
consideration if suitably chosen. 



Under this gauge transformation the transformation of a geometric quantity T is given by (see appendix Eq. (jA16|) 
and Eq. dSHI) 

$„T = (</)_, oV'.),r (2.10) 

= V«o (/.,_, T (2.11) 

__ p[<^£i>x) ^(-'^^•t'Y) T (2.12) 

where we have used the fact that the pull-backs of the transformations induced by the gauge choices can be written 
as Taylor series in terms of the exponential notation as 

0,^r = e'^fxT, (2.13) 

^,^r = e'^^xT, (2.14) 

(for more details again see the Appendix Eq. (jA16p to Eq. (jAlSp ). Also note that the T here have to be evaluated 
on A4o but putting Tq would be confusing. 

Now we invoke the Baker-Campbell-Haussdorf formula \1&\ which enables us to write $*eT in the following form 

$.er = expl^-£5 It, (2.15) 

where 

^i=Y-X, ^2 = [X,Yi and ^3 = ^[X + Y,[X,Y]] , 

and X and Y are the gauge generators, i.e. the vectors which determine the gauge choices. Explicitly the first few 
terms of the gauge transformation Eq. (|2.15p are 

+ ll (% + I [%, %] + £l) T\o + 0{e^)) , (2.16) 

where we indicate that T has to be evaluated on the manifold A^o by the notation T . 

If we now use the equation (|2.5|) to introduce the results of Taylor expanding T into the formula Eq. (|2.16p we 
obtain 

To — To , 

Ti = Ti + i^^iTo , 

±2 = T2 + i:4,To + 4^To + 2i:^,Ti, (2.17) 

where ^^ is the vector field generating the transformation and ^^ = e^J^ + ^£^^2 + 0{e^). 

Similarly the map (|2.16p generated by <f> enables us to relate two coordinate systems {U,x) — > {U',x) (see Fig.[T]) 
under an infinitesimal transformation generated by e^'^. In the active view this transformation takes the point p with 
coordinates x^(p) to the point q = ^e{p) with coordinates x^{q). Note that in the active view it is the points that 
change. Applying the map (|2.15p it follows 

a;^(g) = e^^^lp cc^(p), (2.18) 

where we have used the fact that when acting on scalars £^ ~ ^'^ S- and the partial derivatives are evaluated at p. ^ 
The left-hand-side and the right-hand-side of Eq. (|2.18p are evaluated at different points. Equation (|2.18p can then 
be expanded up to second-order as 

x^{q) = x^{p) + ee,{p) + ^e^ [e^M^i^iP) + e2{p)\ ■ (2.19) 



^ Note, that if we had used a minus sign in the exponent of Eq. 1)2. ISp the signs in the equation relating the coordinates in the passive 
approach, Eq. 1 12. 231 1 below, would conform to those usually found in the literature. 



Note that we do not need Eq. (|2.19p to calculate how perturbations change under a gauge transformation in the active 
approach, it simply tells us how the coordinates of the points p and q are related in this approach. 

2. Passive point of view 

In the passive approach we specify the relation between two coordinate systems directly, and then calculate the 
change in the metric and matter variables when changing from one system to the other. As long as the two coordinate 
systems are related through a small perturbation, the functional form relating them is quite arbitrary. However, in 
order to make contact with the active approach, discussed above, we take Eq. (|2.19p as our starting point. 

Note, that all quantities in the passive approach are evaluated at the same physical point. To take the passive 
approach further, we therefore need to rewrite the left-hand-side and the right-hand-side of Eq. (|2.19p . since they are 
evaluated at two different coordinate points, as described above (see also Fig. [5]). We choose p and q to be points 
such, that the coordinates of q in the new coordinates are the same as the coordinates of p in the old coordinates, 
i.e. xt^{q) ~ x^^ip), then use Eq. ()2.19p to derive 



xt^{q) = xf'ip) 



x^iq) - ee^ixip)) ~ le' klMpmU^ip)) + ^^i^ip)) ■ (2.20) 



Using the first terms of Eq. (j2.19p we have 

x^(g)=a;^(p) + 6Cr(p), (2.21) 

to get a Taylor expansion for ^^, 

er(p) = er(^^(9)-<r(p)) 

= ?r('?)-<r(9),.?i'^(?), (2.22) 

where in the very last term we have replaced S,i'^{p) by ^i'^{q), the correction being of third order. Substituting 
Eq. (|2.22p into Eq. ()2.20p finally gives the desired result, namely a relation between the "old" (untilded) and the 
"new" (tilde) coordinate systems, 



X^{q) = x'^iq) - e^f (g) - e'- [^^{q) - ^^iq)J,''{q)\ , (2.23) 

all evaluated at the same point q. 

III. APPLICATIONS 

As an application and illustration of the above we now derive the transformation behaviour under gauge transfor- 
mations of some quantities at first and second order. We start at first order by highlighting the two different points of 
view in how the vector fields inducing the coordinate change affect the perturbations, as detailed above in Section FlIBI 

Before studying the transformation behaviour of the perturbations, we define and relate them to their respective 
backgrounds in the following. As our first example we choose a four-scalar, we use here the energy density p, which 
can be expanded up to second order using Eq. (j2.5p 

p^ Po + Spi + -Sp2, (3.1) 

where we already split Sp into its first and second order parts according to Eq. ()2.6p , the subscripts denoting the order 
of the perturbations. 

Our second example is given by the metric tensor 5^,^, as outlined in Eq. p.ip . In particular, using Eq. (j2.5p . the 
complete Friedmann-Robertson- Walker metric tensor, up to and including second-order perturbations, can be written 
as 



goo = -a' (1 + 201 + ^2) , (3.2) 

go^ = a" (bu + ]^B2i\ , (3.3) 

5,j = a" [% + 2Ciy + C2y] , (3.4) 



where we assumed a flat (A' = 0) background. 

The first and second order perturbations Bn and Cuj, and B2i and C2ij, can be further split according to Eqs. 
and (|3.6p below into scalar, vector and tensor parts (defined according to their transformation behaviour on spatial 
3-hypersurfaces) , 

B, = B^,~S,, (3.5) 

Gij = --0 7,j +£^,jj +F(,j) + -/ly . (3.6) 

where the vector parts, Si and Fi^ are divergence free, and the tensor part, hij is divergence free and traceless, i.e. 

^\, =0, F\^Q, h%^0, h\^0. (3.7) 

The order of the perturbations in Eqs. (|3.5p . (|3.6p . p.7p and has been omitted in the above for ease of presentation. 
Note that -0 is the curvature perturbation, describing the intrinsic scalar curvature of spatial hypersurfaces. Further- 
more (j) is the lapse function, hij the tensor perturbation describing the gravitational wave content, B and E describe 
the scalar shear, and Si and Fi the vector part of the shear. 

Here and in the following we assume a flat background without loss of generality, just simplifying our calculations 
and allowing us to use partial derivatives in expressions such as Eq. (|3.6p . 

The perturbations are decomposed into scalar, vector, and tensor parts since at linear order the governing equations 
for the different types decouple. This is however no longer the case at higher orders, and indeed we already see from 
the gauge-transformations and the definitions of gauge-invariant variables at second order that e.g. the energy density 
(a scalar quantity) on flat hypersurfaces now also contains first order vector and tensor parts, see Eq. ()3.53p . 

Finally, we should point out that the decomposition of the metric tensor in Eq. (|3.2p is not unique. This is already 
evident in the temporal part of the metric tensor, where the lapse function </> is here simply expanded in a power series, 
(j) = (j)i + ^(f>2 + . . .. Alternatively we could have expanded exp((^) into a power series, this obviously doesn't affect 
the physics. More importantly, also the decomposition of the spatial part of the metric tensor, that is Eq. (|3.6p is not 
unique. Indeed, other decompositions are in use and can be just as useful or better, depending on the circumstances 
and the application intended. For example it can be useful instead of expanding ij: in Eq. (j3.6p directly into a power 
series, to expand e"^ (see e.g. Ref. [I4I, and for a relation of the two expansions Ref. [IS]). 

A. Passive point of view 

The passive point of view is very popular at first order, see e.g. the original paper by Bardeen [5[, the review by 
Kodama and Sasaki [6| , and the one by Mukhanov, Feldman, and Brandenberger [^ . 

The starting point in the passive approach is to identify an invariant quantity, that allows to relate quantities to 
be evaluated in the two coordinate systems. We denote the two coordinate systems by x^ and x^ system, and their 
relation is given by Eq. (j2.23p . We choose as an example the energy density, p, which as a four scalar won't change 
(however, once it has been split into different orders, it will change). Another invariant is the line clement ds^, which 
allows to study the transformation properties of the metric tensor, by exploiting the invariance of rfs^, i.e., 

ds^ = g^udx^dx'^ — g^vdx'^dx^ , (3-8) 

which we here will not pursue, but see e.g. P.[l3|. 

Turning instead to the energy density as an illustrative example, we get the transformation behaviour of the 
perturbation from the requirement that it has to invariant under a change of coordinate system and therefore has to 
be the same in the x^^ and the x^^ system, that is 

p{x>^) = p{xn . (3.9) 

To first order, the two coordinate systems are related, using the linear part of Eq. (|2.23p . by 

i" = a;" - (1 ■ (3.10) 

Before we can study the transformation behaviour of the perturbations at first order, we split the generating vector 
^^ into a scalar temporal part ai and a spatial scalar and vector part, /3i and Ji, according to 

Cr = (ai,/3i.'+7i% (3-11) 



where the vector part is divergence- free dk^^ = 0. Then expanding the left-hand side of Eq. p.9p . we get neglecting 
terms of O(e^), 

= p{xn~p,xe 

= po + Spi - p'oai , (3.12) 



and similarly expanding the right-hand side of Eq. p.9|) , we have 

p{xn = poixn + spiixn ■ (3.13) 

Finally, since by assumption /5o(a;^) = pa{x^), we get 

8pi = 5pi+ p'^ai. (3.14) 

Note that all quantities are evaluated at the same physical point. 

B. Active point of view 

We now turn to the active point of view when calculating the effect of gauge transformations on perturbations. 
Here, as detailed in Section IIIBI above, one actively maps the perturbed quantities from one manifold to another. 
The relation of the coordinate systems on the two manifolds is also induced by the map. 

At first order the preference of which approach to use is a question of taste, and as pointed out above most first 
order papers use passive view point, but see e.g. [20| for first order active calculation. However, at second order we 
found the active view point easier to implement, and it is used in many other second order works, e.g. Refs. |8l.l9l. Illf. 

1. First order 

As in the passive view section above, we start with the energy density. It follows immediately from Eq. (|2.17|1 that 
to first order a scalar quantity such as the energy density transforms as 

5pi^ 5pi+ p'^ai. (3.15) 

The transformations of the first order metric perturbations also follow from Eq. (|2.17p . We then find that the 
metric tensor transforms at first order, as 

555 = <5e^ + gfU' + gf^e, ,. + 9TM ,. (3.16) 

As another example we now turn to the spatial part of the metric tensor. Note that Eq. (|2.17|) gives only the 
transformation of the total spatial part of the metric, Cy. If we then ask how the components of Cy transform, we 
have to use Eq. (|3.7p . 

To get the change of the metric functions in the spatial part of the metric under a gauge transformation, we get 
the transformation of the spatial part of the metric 5gl, , and hence Cuj , from Eq. p.l6p as 

2Ciij = 2Ciij + 2HaiSij + £,uj + ^y^j , (3-17) 

where we reproduce Eq. (j3.6p above for convenience at first order, 

2Cuj - -2^1% + 2Si,„- + 2i^i(,j) + huj . (3.18) 

Taking the trace of Eq. (j3.17p and substituting in Eq. p.lSp we get 

- 3V^i -I- V'^Ei = -3V'i + V^Ei + iUai + V^/^i . (3.19) 

Now applying the operator d'^d'-' to Eq. p.l7p we get a second equation relating the scalar perturbation 01 and Ei, 

- 3V2V'i + V^V^i^i = -3VVi + V^V^Si + iJiV'^ai + V^V^/?! . (3.20) 



Taking the divergence of Eq. (|3.17p we get 

2C^^/ = 2C^^l + 2Hai,, + V^iu + y^Pl,^ ■ (3.21) 

Substituting in our results for ipi and Ei we the arrive at 

V^Fh = V^Fi, + VV • (3.22) 

We can sum up the wcU known transformations of the first order metric perturbations we have from the above, 
first for the scalars as (e.g.[20|) 

J[ = (j)^+ Hai + a[ , (3.23) 

^1 = Vi - Wai , (3.24) 

Bi = Bi-ai+l3[, (3.25) 

El = Ei+Pi, (3.26) 

where Ti. = a' /a, and for the vector perturbations as 

S^ = ^1^-71^', (3.27) 

F^' = i^i'+7i'- (3.28) 
The first order tensor perturbation is found to be gauge- invariant, 

hiij = huj . (3.29) 

by substituting Eqs. (|3.23p to (|3.28p into Eq. (|3.17p . This can also be understood from the Stewart- Walker lemma 
[2|: at first order, quantities that are identically zero in the background are manifestly gauge- invariant, and there is 
no tensor part in the background. However, as we shall see below, this only works for quantities at the next higher 
order: for example the second order tensor perturbations will in general not be gauge- invariant. 

2. Constructing gauge-invariant variables at first order 

To construct a gauge-invariant quantity, say the energy density on flat slices, that is hypersurfaces on which tpi = 0, 
we see from Eq. p.24[) that this gives 

ai = |i . (3.30) 

All we need to do next is to substitute Eq. (j3.30p into Eq. (j3.15p . and get a gauge- invariant in the sense of being 
independent of gauge artifacts, for example the energy density on flat slices 



Spi 



^ =Spi + ^^Pi. (3.31) 

flat n 



This is gauge-invariant in the ^'^-independence sense, but it does depend on the choice of background (e.g. a background 
depending on a;* instead of just time as in FRW would obviously give a very different result). This works for all the 
perturbations and also at second order and higher. 

We conclude the above example by observing that to remove the gauge modes on sub-horizon scales, often referred 
to as specifying the threading, we can choose Ei = 0, which gives 

(3i = -El . (3.32) 

For the vector modes we choose F^ = 0, which gives 

7i' = -Fi . (3.33) 

Hence in this gauge the spatial part of the perturbed metric is zero with the exemption of the tensor-modes. 



3. Second order 

At second order the generating vector ^2 is split into a scalar time and scalar and vector spatial part, similarly as 
at first order, 



C^ 



(a2,/32 



-72 



(3.34) 



where the vector part is divergence-free dt^^ = 0. We then find from Eqs. ()2.17p that a four scalar transforms at 
second order 



5p2 = 5p2 + p'oa2 + oii{p'^ai+ p'f^ai +26p[) 



(3.35) 



We see here already the coupling between vector and scalar perturbations in the last term through the gradient and 
^i . The gauge is only specified once the scalar temporal gauge perturbations at first and second order, ai and a2, 
and the first order spatial gauge perturbations, j3i and 7^*, are specified. 



The metric tensor transforms at second order, from Eq. (j2.17p as 



.(2) _ XJ2) 



JO) ^A 



J")cA 



(O)eA 



,(l)eA 



,(l)cA 



hlV = K'^ + C,^ + <i^ . + 5l. ^2 ., + 2 -55-,^!^ + -5.9-^1^ ,. + Sg^'JCt , 



,(0) 



-,(0) fA 



(0) 



,(0) 



f-a (^A 






„(0).A ^a 



(0) fc\ 



+ff^A (^1 ,yaCl + ^1 ,q'?1, i/j + 9\i, (Cl ,^aCl + Cl ,aCl, /i) 



(3.36) 



Now following similar lines as at first order in the previous section, we could get the transformation behaviour for the 
second order lapse function 02 straight from the — 0-component of Eq. p.36p . 

Instead, to keep the discussion as brief as possible, we now turn to the transformation behaviour of the perturbations 
in the spatial part of the metric tensor. Here we can follow a similar procedure as in the linear case. But, the task 
is made more complicated not only by the size of the expressions but more importantly by the fact that now we will 
have to let inverse Laplacians operate on products, in order to get the transformations of the scalar, vector, and tensor 
parts of the spatial metric. 

Using Eq. (j3.36p we find that the perturbed spatial part of the metric, C2ij, transforms at second order as 



2C. 



2ij 



2C2ij + 27ia2% + 6 



2i.j 



-e 



2 J, I 



Xi. 



(3.37) 



where we defined Xij to contain the terms quadratic in the first order perturbations as 



Xij — 2 



a 
a 



n^ + —\aj+n {aia[ + ai,fcCi 



+4 



ai {C[,^ + 2HCu,) + CujMi'" + CufcCi'' ,, + Cik.^i'' ,,] + 2 (Bi,aij + Bij-ai,,;) 



+S.liMl 



■ iljMl 



■Cuai,3 +Ci,ai,i- 



(3.38) 



Note that in Eq. (|3.38p above and in the following wc will not decompose the spatial part of Eq. (|3.1ip . ^l = [3^ ' + 7i*, 
whenever convenient to keep the presentation as compact as possible. 

The perturbed spatial part of the metric, C'2ij, is decomposed in Eq. (j3.6p above into scalar, vector, and tensor 
part, which we reproduce here at second order. 



2C2,j = -2^2% + 2S2,y + 2F2(,j-) 
Taking the trace of Eq. (|3.37p and substituting in Eq. (|3.39p we get 



l2ij 



(3.39) 



3V'2 



y^E2 



-3V'2 + V2£;2 + 3Ha2 + V2/32 + -X' 



k ' 



(3.40) 



where we find X\ to be 



+2 [a, (C,'^ ,' + 2HC,'' fe) + C," ,,Ci' + ^C.^'Uk] + 2B,ka,^ (3.41) 

Now applying the operator d'^d^ to Eq. p.37p we get a second equation relating the scalar perturbations 1/^2 and i?2, 

- V^i'2 + V^V^i^s = -VV2 + ^^V^E2 + HV^as + V^V^/Ss + i^-'^'^^j , (3.42) 

This gives for the transformations of the curvature perturbation at second order, 

V;^ = V^2 - na2 - \x% + \v-''r\,^ , (3.43) 



and for the shear scalar, 



E2^E2+ 132 + tV-^v-^a"^' . - -V-^X\ ■ (3.44) 

4 . J 4 



Taking the divergence of Eq. p.37p we get 

2C^^P = 2C2,/ + 2na2., + V^^2^ + V2/32,, + X^^'^ . (3.45) 

Substituting in our results for ip2 and E2 we then arrive at 

V2i?2. = V^F2^ + V^^12^ + A'.fc, '^ - V-^;^'^'^^;, . (3.46) 

Finally 

^2. = ^2. + 72. + V-^A-,,^ '= - V-^V-^A-^',,,; . (3.47) 

We finally turn to the tensor perturbation at second order. Substituting our previous results for tp2, E2, and F2i 
into Eq. (|3.37p we get, probably surprisingly, 

h2rj = h2^, + X,, + ^ {V'^ X^\^i - X\) % + \v-^V-^ X^\^i,^ 

+ ^V-2A'\,^. - V-2 (A',,_ '^ ^ + A;-,,^ ,;) . (3.48) 



Although the second-order tensor transformation /i2y is not dependent on the second-order gauge- functions ^2 : it 
does depend on first order quantities quadratically. 

The same holds for other quantities that are zero in the background: the first order quantity is gauge-invariant by 
virtue of the Stewart- Walker lemma [3| (and by construction) . However the second order quantity is no longer gauge- 
invariant, as shown above in the case of the tensor perturbation, h2ij. This is not a violation of the Stewart- Walker 
lemma, it merely shows that the second order quantities "live" in a first order "background" . Another example is the 
anisotropic stress, which is gauge-invariant at first order, but not at second. 

^. Constructing gauge-invariant variables at second order 

We can now construct, just as at first order in Section lilIB21 gauge-invariant variables at second order. We choose 
the same example as in the previous section, namely the energy density on flat hypersurfaces. But now also give the 
second order tensor perturbation in this gauge. 

To specify the gauge at second order we choose hypersurfaces on which V'2 = 0, and we see from Eq. (|3.43p that 
this gives 



_ ^'2 , 1 



Y7-2 yij -yk 

^ ^fia.t,i] '^flatfc 



(3.49) 



where we get Xfi^tij from Eq. p.38p using the first order gauge generators given above, as 



-^fli 



a.t ij 



V-i ( Ij- + 2V^i ) + ^i.^eU 



S^, + -i'l{C[,,+2HCu,) 



n 



+4Ciy,fc^lfijjt + (4Cljfc + 6fiati,fc) Clflatj- + (4Cljfe + 6flatj,fe) C 



Iflat,; 



2 



V>i,, (2Bi, + eiflay) + ^1 J (2Si, + em.,,) - TT^^i^.^i J 



^2 



where we defined 



The trace of Eq. ((XSD)) is then 



■^flatfc — 6 



^l (?ifiat(i,i) + 47i^lflat(ij) j + 2Ciflat'?lflat(ij)fc + 26flatfc,i^lflat j" : 



Clflati = - (-El,, + Fu) 



(3.50) 
(3.51) 



^-1(^ + 2^1)+^!,^?^ 



H 



4'i{cf, + 2nc^, 



+4Ci fc,iClflat + 4 (2Ci + ^iflat. ) ^lflat(fc,0 - 2V ^l.fcClflat 



(3.52) 



Then substituting at first order Eqs. (f?^ . (f?^ . and (f?^ . and at second order Eq. ((^^ into Eq. ((3351), we 
get the second order energy density perturbation on uniform curvature hypersurfaces [lO| 



Sp: 



2flat 



SP2 + ^^'2 + ^ (v-^A-^Im, - -flat. 



A-^^ 



+ 



H' 



n 



p'^iJi + p'o[iA-—ibi]+2nSp[ +l2Sp, + !^^^\ e 



Pn. 



n 



(3.53) 



5 Iflat 



,k 



The second order tensor perturbation in that the flat gauge, i.e. on uniform curvature hypersurfaces, is given by 
substituting Eqs. ()3.30p . (j3.32|) . and (j3.33|l . into Eq. p.48|) . and we find after some algebra 



^2flaty - h2ij + Xnatij + 2 (^ -^flat.fci " -^flatfe) ^ij + 2^ ^ '^fiiit,kUj 



1. 



— 2 v^' 



-v-^x, 



fiat k.ij 



- V-' {X, 



a&tik^j ' tiatjk.i 



Xa 



k 



(3.54) 



IV. DISCUSSION AND CONCLUSIONS 



This is neither the first, nor will it be the last, discussion of perturbation theory in cosmology. However, in this 
concise introduction we have tried to strike a balance between mathematical rigour and ease of application of the 
results. For a more detailed exposition of cosmological perturbation theory and further references see Ref. [2l|. 

We have here studied perturbations about a flat FRW background spacetime. But as pointed out above, the 
formalism introduced by Bardeen can easily be applied to other settings and background spacetimes, and can also 
be extended beyond GR. Indeed, perturbation theory and the formalism discussed in this paper can be applied 
to all covariant metric theories. Although here we have assumed standard four dimensional (4D) Einstein gravity 
throughout, the formalism has also been applied, for example, to 5D braneworld models (see e.g. Ref. [2^ for an 
overview), and has been used to construct gauge-invariant variables in that theory. 

Whereas most of the material discussed in the previous sections has been expounded elsewhere, albeit often in 
different form and with other aims, we are not aware of the derivation of how the second order tensor perturbations 
transforms in full generality under gauge-transformations being discussed elsewhere (see however Ref. [231 for the case 
of scalar perturbations). Also its representation in the uniform curvature gauge has been discussed for the first time. 
These results will be of particular interest in second order calculations of the gravitational wave background [24| . The 
transformations at second order of the decomposed components of the spatial parts of the metric have also not been 
discussed in the literature before, and will be particularly useful in relating quantities calculated in different gauges. 
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APPENDIX A: DEFINITIONS 

In this appendix we bring together some key ideas and definitions from differential geometry, in particular those 
regarding maps of manifolds, which are useful in setting up perturbation theory in general relativity where covariance 
matters. This appendix is not a comprehensive study as our intention is only to show, intuitively, why certain 
expressions take the form they do and to define some key ideas. In order to be useful to those who do not wish 
to enter into the formalism of differential geometry we will use coordinate expressions where possible. For those 
who worry about these things we will therefore be working in coordinate neighbourhoods and all functions will be 
assumed to be adequately differentiable as they are for most of cosmology. For more details and a less coordinate 
dependent approach the reader is referred to the books by Hawking and Ellis [25| and by Wald [2^ . 

• Maps between Manifolds 

We assume some familiarity with the definition and properties of differential manifolds and are concerned here 
with maps between such manifolds, in particular a diffeomorphism. Consider two manifolds M and yVte and 
denote the chart maps (coordinates) on each by 

fa-Oa -^ Ua for Oa C X , C/^ C R" , 

gb-.Ob -^ Ub for Ob C Me, C/b C M" . (Al) 

In words the function fa assigns coordinates (in Ua) to points in the n-dimcnsional neighbourhood Oa of Ai 
and Qb does the same in the neighbourhood Ob of A^^. 

The map 

(l):M^Me, (A2) 

is C°° , i.e. infinitely continuously differentiable in the advanced calculus sense, if for each a and b the map 

faO(^-^og;^:Ub^Ua, (A3) 

is C°°. The map is a diffeomorphism if it is one-to-one, onto and (p and its inverse (p^^ are C°° . Loosely 
speaking, for coordinates {x* on Ua and y* on Ub }, Eq. (jA3[) relating the coordinates under the map (j) can be 
represented by the equations 

y^ = r i^n , (A4) 

with inverse 

x'^ ^ {cf,-'r{yn , (A5) 

where </>'' and (0^^)^ are C°°. 

• Maps of Vectors induced by mapping manifolds 

A manifold map (f) : Ai ^ A4e induces a map of the tangent vectors at p on 7W to tangent vectors at (j){p) on 
Me- We will write this map as (j)* : Vp —>■ V0(p) where Vp and V0(p) denote the tangent spaces at p and 4>{p). 
Using the coordinate description above, the map 0* can be written as 



and if X'' e Vp and Y" e V^(^p) then 



(A6) 



hX: X'^-^Y^^, or Y^^ ^ {c^yi X" ^ ^ X"" , (A7) 



in the usual notation. The map 0* is sometimes called a pushforward. 



• Pullback 

The concept of a pullback is well described by its name. If we are given a function / defined on a manifold TV 
so 

/ : AA ^ M" , (A8) 

and a manifold map 

<j):M^N, (A9) 

then (j) can be used to pullback f from A/" to A^ by using the composite map f o (j). This pulls / back from J\f 
to map M to M", i.e. 

fo(j>:M^W. (AlO) 

This is perhaps easier to follow if viewed as a sequence: the (j> maps A^ to A/" then / maps Af to R" or, in 
symbols, 

fo(j):M — >f:N — >M". (All) 

The map / o is the pullback of /. 

• Maps of covectors induced by mapping manifolds 

For covectors (forms or covariant vectors) the map between manifolds </> : A/I ^ A4e induces a pullback map 0* 
which takes covectors at (j){p) on A4e to covectors at p on A/(, i.e., 

ck* ■■ W^0(p) -* Wp , (A12) 

where W^0(p) denotes the cotangent space at (l>{p) on A4^ and Wp is the cotangent space at p on Ai. This can 
be written in coordinates, by letting X^ e Wp and YJ^ G T^0(p), and we can write 

The mapping of covariant and contravariant tensors of higher rank follows the same pattern as that for the 
vectors and covectors. Tensors with mixed co- and contravariant indices also follow this pattern, although 
proving it requires a little care (see [2y| p. 438). A pullback on a vector field effectively reverses the effect of 
a pushforward and so the definitions of pullback and pushforward act to preserve the scalar property of the 
product of a vector and a covector. 

• Pullback of a Composite Map 

Note that for composites of maps the pullback behaves in a different way to the push forward. For pushforward 
maps, 

0:A/I^7V, and ip : Af ^ V , (A14) 

the composite is simply 

ijo(j):M^V. (A15) 

For the corresponding pullback maps we have 

(f>,:Af^M, i)^:V-^N, (A16) 

(V' o 0), : P ^ M, (A17) 

but, and this is important, if we remove the bracket we have to reverse the maps 

0* o '0* : P ^ A/( , (A18) 

i.e. {ip o 4>)* — (j)* o ijj* as can be seen by looking at the actions of the maps themselves. 



• Lie Derivative 

Let A^ be a differential manifold and let ^ be a vector field on A4 , then ^ generates a map of M onto itself as 
follows. In a coordinate neighbourhood solve the system of ordinary differential equations 

in M". Given initial points in A^ at e = this will generate a unique one parameter family of integral curves 
with one and only one curve through each point in a neighbourhood. For an integral curve 7 let 4>eip) be the 
point a distance e along 7 from p. Applied to the family of curves (jje generates a one parameter family of 
diffeomorphisms of M onto itself. Clearly (/)ej o cj)^.^ = 0^^ + cj)^^ and (j)o is the identity map. 

Now let T be a tensor field on A4 then the puUback 0* of (p defines a new tensor field 0*eT on A^ which is a 
function of e. This enables us to define the Lie Derivative, a covariant differentiation on M, which does not 
increase the rank of the tensor. 

£fT := lim - ((/)„ T - T) , (A20) 

This can be interpreted as follows - the map 0*£ pulls back the value of T at 0£ (p) to p from which we subtract 
the actual value of T at p. This difference is a well defined tensor quantity since the difference is taken at the 
point p. We can now take the limit to obtain a meaningful derivative at p - called the Lie Derivative. 

• Exponential operator 

Given an operator A the exponential operator is defined by the formal series 

e'^:=l + A+^A' + J2(^)^"- (A21) 

n=3 \""/ 

Thus, for instance, if A = e£^ is the Lie Derivative operator multiplied by e the corresponding exponential 
operator is given by 

2 °o / n\ 

= l + e£, + -4 + J2(-)£-,. (A22) 



e 



n— 3 



[1] 

[2] 
[3] 
[4] 
[5] 
[6] 
[7 
[8 
[9 

[lo; 

[11 
[12 

[13 

[14 
[15 
[16 
[17 
[18 
[19 
[20 
[21 
[22' 
[23^ 
[24' 
[25^ 



R. K. Sachs, in Relativity, Groups and Topology, ed C. De Witt and B. De Witt, Gordon Breach, New York (1964). 

J. M. Stewart and M. Walker, Proc. Roy. Soc. Lond. A 341, 49 (1974). 

J. M. Stewart, Class. Quant. Grav. 7, 1169 (1990). 

V. F. Mukhanov, H. A. Feldman and R. H. Brandenberger, Phys. Rept. 215, 203 (1992). 

J. M. Bardeen, Phys. Rev. D 22, 1882 (1980). 

H. Kodama and M. Sasaki, Prog. Theor. Phys. Suppl. 78, 1 (1984). 

R. Durrer, Fund. Cosmic Phys. 15, 209 (1994) [arXiv:astro-ph/931 1041] . 



M. Bruni, S. Matarrese, S. Mollerach and S. Sonego, Class. Quant. Grav. 14, 2585 (1997) 'arXiv:gr- qc/9609040l . 



V. F. Mukhanov, L. R. W. Abramo and R. H. Brandenberger, Phys. R ev. Lett. 78, 1624 (1997) ar Xiv:gr-qc/9609026 
K. A. Malik and D. Wands, Class. Quant. Grav. 21, L65 (2004) jarXiv:astro-ph/0307055j . 
H. Noh and J. c. Hwang, Phys. Rev. D 69, 104011 (2004) 



N. Bartolo, E. Komatsu, S. Matarrese and A. Riotto, Phys. Rept. 402, 103 (2004) [arXiv:astro-ph/0406398 
K. Nakamura, Prog. Theor. Phys. 117, 17 (2007) arXiv:g r-qc/0605108, . 

G. F. R. Ellis and M. Bruni, Phys. Rev. D 40, 1804 (1989). 

D. Langlois and F. Vernizzi, Phys. Rev. D 72, 103501 (2005) 'arXiv:astro-ph/0509078l . 



C. F. Sopuerta, M. Bruni and L. Gualtieri, Phys. Rev. D 70, 064002 (2004) arXiv:gr- qc/0306027 

D. S. Salopek and J. R. Bond, Phys. Rev. D 42, 3936 (1990). 

D. H. Lyth, K. A. Malik and M. Sasaki, JCAP 0505, 004 (2005) arXiv:astro-ph/0411220] . 



K. A. Malik, arXiv:astro-ph/0101563 



K. A. Malik and D. Wands, JCAP 0502, 007 (2005) [arXiv:astro-ph/0411703l 

K. A. Malik and D. Wands, in preparation (2008). 

R. Maartens, Living Rev. Rel. 7, 7 (2004) arXiv:gr-qc/0312059'. 

C. Pitrou, Class. Quant. Grav. 24, 6127 (2007) arXiv:0706.4383 [gr-qc]]. 



K. N. Ananda, C. Clarkson and D. Wands, Phys. Rev. D 75, 123518 (2007) arXiv:gr-qc/0612013 . 

S. W. Hawking and G. F. R. Ellis, The Large Scale Structure on Space-Time, Cambridge Univ. Press., Cambridge, UK 
(1973). 
[26] R. M. Wald, General Relativity, Univ Chicage Press, Chicago, USA (1984). 



